Picture for Jingcheng Ni

Jingcheng Ni

InterSketch: An Interleaved Reasoning Model with Self-correcting Visual Sketch and Stepwise Reward

Add code
May 26, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

VideoGPA: Distilling Geometry Priors for 3D-Consistent Video Generation

Add code
Jan 30, 2026
Viaarxiv icon

Coffee: Controllable Diffusion Fine-tuning

Add code
Nov 18, 2025
Figure 1 for Coffee: Controllable Diffusion Fine-tuning
Figure 2 for Coffee: Controllable Diffusion Fine-tuning
Figure 3 for Coffee: Controllable Diffusion Fine-tuning
Figure 4 for Coffee: Controllable Diffusion Fine-tuning
Viaarxiv icon

CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

Add code
Oct 09, 2025
Viaarxiv icon

MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction

Add code
Feb 17, 2025
Viaarxiv icon

Efficient Interactive 3D Multi-Object Removal

Add code
Jan 30, 2025
Figure 1 for Efficient Interactive 3D Multi-Object Removal
Figure 2 for Efficient Interactive 3D Multi-Object Removal
Figure 3 for Efficient Interactive 3D Multi-Object Removal
Figure 4 for Efficient Interactive 3D Multi-Object Removal
Viaarxiv icon

UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving

Add code
Dec 06, 2024
Viaarxiv icon

HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving

Add code
Dec 03, 2024
Figure 1 for HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Figure 2 for HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Figure 3 for HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Figure 4 for HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Viaarxiv icon

PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation

Add code
Nov 24, 2024
Figure 1 for PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Figure 2 for PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Figure 3 for PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Figure 4 for PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Viaarxiv icon